CDS

Accession Number TCMCG019C14800
gbkey CDS
Protein Id XP_022943635.1
Location complement(join(5595139..5596662,5596765..5596899,5597064..5597117,5597216..5597293,5597931..5598013,5598098..5598143,5598318..5598437,5598739..5599047))
Gene LOC111448345
GeneID 111448345
Organism Cucurbita moschata

Protein

Length 782aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023087867.1
Definition splicing factor 1-like isoform X2 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category A
Description K homology RNA-binding domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K13095        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGAGTGCAGAGGTTGAAAAGGCATCTTGTATCGAGTCTAGAAGTGCAAAGATGTCTGGAGCAACTGTTACTTCTGCTGCACCTATGGGAAGCCAAAAGGTTTCCATGTTTGCAGCAAAGACTGGGTTTGTTATACCAAAAAACAAACTTTCAGGGTCTTTGGTTCCCATCTTCCGAGTGAACAAAAAGTTGGGAGGGAATGAATCCGCTAATGGAGAAAATGTGAAACAGACCCAAAGAAATACAAAGTGGGGTCCTGATTTAACACAGGATACTGCTGTCAGAAAGGGGAGGCTCATAGCTTATCAGACTCGATTGGAACAAATCAGGGTACTCCTTAAATCTGGAACTTTGGAGGTTCCAAAGACACAAGTTTCTGAAGCTGAGAATGTGGATGATAATTCCCCTGGACCTCAAGGGAATAATAAGGCGCTGAACAATGAACTTTTGGAACTTGAAAAACGTGAAGTTATTGGTGAAATACTAAAACTGAATCCAAGTTATAAGGCCCCTCCTGATTATAGGCCCTTGTTGAAAGAGGACAGCTTACCTCTCCCGGTTAAAGAATATCCTGGTTACAACTTTATTGGCTTAATATATGGCCCCAGTGGTGAAAATCAAAAGCGATTAGAAAAGGAGACTGGAGCCAAAATACGAATTTGCGGCATTAAAGCAGGGACAGGTGAAAAGGATGAAATTAAACCAACTGATGTACACGAAACTCAGAACGCTTATGAAGAGCTGTACGTTTGCATGTCAGCTGATACATTTGATAAGATTGATGCTGCAATTTCTGTTATTGAACTCCTAATCACCTCAATATCGGGAAATCTGGCCACTGGCTCCACATTGTCTGACTTGGTTTCTACGGAGGAAAGTTCTTCCAGCCGAGCCGAGGGTACTACAGTCTCAAATATGGGGCAGACTCCTGTGCCGAACCAGGGGGTTATGCATCAACTACAAGTTTATGCGCCAACTCCGATGCAAGGCCAGTTTCGTTATCCTAGTGCATGGCCTTCTCACAATTTACCGCCTGCTCCTGCATTTATTTCCCCACAAGATCCTCCGTCATCATTTTCTCGTCCACCTGCTCCAGTTGCTTTCAATCCAGCTTTCCGGGGCCCTCCTGTTCCTCCTCCAAGACAGCAGTTTCCTGCACAGGACTTGCAGCAACCTTTCATGACTCAAACCAGTCACGTTGGCCAACCCAGAGTAAATGCTTTGACAGTTCAACGCCCCTCATTGGTTCCTTCTAATGTCTCAAATCCAAACTTCACTGGTAGTGGTCAATTACCTTCAGGACCACTCCCGAATATGCCAGGATCATCAATTCCCTCAGCTTTGCCTCAACTTGTTCCTGGTAGCATTCCTCCTGGACCACGGCCTGACCGTCCATTAGCACCTAGCATAGTTTCTACTGGTTTTTCTGGTCCCGCAGTTGGCAGCTCAGCATCTATGGGTCCAAATAACATGGGGCAGATGGCTCTATCGATTGCCCCACCCTTTCTGCCTCGTGCAGCTCCACCGCATGGTGTTAATTCTTCTGGCACAGCATCTGCAAATGCAGCAGTAGCCAATGTAGATGGATATGCATCTTTTACTTCTGGGCCGCCCACCCCCCAAGCTATGAGTATACATAAAAATCACCCTATTACACCTCCAATTCCGTCACCCCAGATGGGGCATCGCCCACTATTTGCAGCACATAATCCTGCTGGTAACTTCATTGCTGGATCTGCTTCAACCCCTCCAACACCACCTACCAATACCAGCAATTTTACATTCCAACCACGTGGTCCACAAAATCCATCTCCTCATACAATTCTGAATTTGAACATTCAAAACACACCTACCGTACCTACATTGCAACAGCCTGCATCTGGGGCGCCATCTTTCCATCCAGCAGCCCCAAATTTTATGAGAGCTGCCAATCAACCCTTTCCCGGACCTCAAGCTGGCAGCCAGATAGGTAATCATCAAATTCAAGAGGTAGCTTCAAATCCTATTGGCATGCAGGTCTCGGCTAGGATTCCTGCTTTCCTCGATCAAGGTCCTCGAACACAACTGCATCAAGGAAACTTTAGTCCAGCCATGCAAATGCAAATGCAAATGCCGAACTTGCCACGCAATTTTACTCACAGACCCGGGAATGCCATGCAACTTGAACAATGTTTCCCCATGCGAGCTCCTCGACCTGAAGTCCGCTTTACTCCCCCACGGTACAGTAGCAATCTGGCGTTTGTTTCTGGTAGGCCACCTCCCATTTCCGGTGGGCAGCAAGTTTATGATCCATTCTCGCCTACATCTGTAGCTGGTACACAACAGCAGGGGAGCAATCCGCCAAGGTGA
Protein:  
MSAEVEKASCIESRSAKMSGATVTSAAPMGSQKVSMFAAKTGFVIPKNKLSGSLVPIFRVNKKLGGNESANGENVKQTQRNTKWGPDLTQDTAVRKGRLIAYQTRLEQIRVLLKSGTLEVPKTQVSEAENVDDNSPGPQGNNKALNNELLELEKREVIGEILKLNPSYKAPPDYRPLLKEDSLPLPVKEYPGYNFIGLIYGPSGENQKRLEKETGAKIRICGIKAGTGEKDEIKPTDVHETQNAYEELYVCMSADTFDKIDAAISVIELLITSISGNLATGSTLSDLVSTEESSSSRAEGTTVSNMGQTPVPNQGVMHQLQVYAPTPMQGQFRYPSAWPSHNLPPAPAFISPQDPPSSFSRPPAPVAFNPAFRGPPVPPPRQQFPAQDLQQPFMTQTSHVGQPRVNALTVQRPSLVPSNVSNPNFTGSGQLPSGPLPNMPGSSIPSALPQLVPGSIPPGPRPDRPLAPSIVSTGFSGPAVGSSASMGPNNMGQMALSIAPPFLPRAAPPHGVNSSGTASANAAVANVDGYASFTSGPPTPQAMSIHKNHPITPPIPSPQMGHRPLFAAHNPAGNFIAGSASTPPTPPTNTSNFTFQPRGPQNPSPHTILNLNIQNTPTVPTLQQPASGAPSFHPAAPNFMRAANQPFPGPQAGSQIGNHQIQEVASNPIGMQVSARIPAFLDQGPRTQLHQGNFSPAMQMQMQMPNLPRNFTHRPGNAMQLEQCFPMRAPRPEVRFTPPRYSSNLAFVSGRPPPISGGQQVYDPFSPTSVAGTQQQGSNPPR